HITS vs. Non-negative Matrix Factorization
نویسندگان
چکیده
Ranking algorithms have been widely used for web and other networks to infer quality/popularity. Both PageRank and HITS were developed for ranking web pages from a web reference graph. Nevertheless, these algorithms have also been applied extensively for a variety of other applications such as question-answer services, author-paper graphs, and others where a graph can be deduced from the data set. The intuition behind HITS has been explained in terms of hubs and spokes as two values are inferred for each node. HITS has also been used extensively for ranking in other applications although it is not clear whether the same intuition carries over. It would be beneficial if we can understand these algorithms mathematically in a general manner so that the results can be interpreted and understood better for different applications. This paper provides such as understanding for applying HITS algorithm to other applications. In this paper, we generalize the graph semantics in terms two underlying concepts: in-link probability (ILP) and out-link probability (OLP). Using these two, the rank scores of nodes in a graph are computed. We propose the standard non-negative matrix factorization (NMF) approach to calculate ILP and OLP vectors. We also establish a relationship between HITS vectors and ILP/OLP vectors which enables us to better understand HITS vectors associated with any graph in terms of these two probabilities. Finally, we illustrate the versatility of our approach using different graph types (representing different application areas) and validate the results. This work provides an alternative way of understanding HITS algorithm for a variety of applications.
منابع مشابه
Iterative Weighted Non-smooth Non-negative Matrix Factorization for Face Recognition
Non-negative Matrix Factorization (NMF) is a part-based image representation method. It comes from the intuitive idea that entire face image can be constructed by combining several parts. In this paper, we propose a framework for face recognition by finding localized, part-based representations, denoted “Iterative weighted non-smooth non-negative matrix factorization” (IWNS-NMF). A new cost fun...
متن کاملA new approach for building recommender system using non negative matrix factorization method
Nonnegative Matrix Factorization is a new approach to reduce data dimensions. In this method, by applying the nonnegativity of the matrix data, the matrix is decomposed into components that are more interrelated and divide the data into sections where the data in these sections have a specific relationship. In this paper, we use the nonnegative matrix factorization to decompose the user ratin...
متن کاملVoice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملA New Algorithm for Community Identification in Linked Data
In this paper, we propose four specifications which can be used for the evaluation of community identification algorithms. Furthermore, a novel algorithm VHITS meeting the four established specifications is presented. Basically, VHITS is based on a two-step approach. In the first step, the Nonnegative Matrix Factorization is used to estimate the community memberships. In the second step, a voti...
متن کاملInstantaneous vs. Convolutive Non-Negative Matrix Factorization: Models, Algorithms and Applications to Audio Pattern Separation
Since the seminal paper published in 1999 by Lee and Seung, non-negative matrix factorization (NMF) has attracted tremendous research interests over the last decade. The earliest work in NMF is perhaps by (Paatero, 1997) and is then made popular by Lee and Seung due to their elegant multiplicative algorithms (Lee & Seung, 1999, Lee & Seung, 2001). The aim of NMF is to look for latent structures...
متن کامل